An effective long-range attraction between protein molecules in solutions studied by 

small angle neutron scattering 
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Small angle neutron scattering intensity distributions taken from cytochrome C and lysozyme 
protein solutions show a rising intensity at very small wave vector, Q, which can be interpreted in 
terms of the presence of a weak long-range attraction between protein molecules. This interaction 
has a range several times that of the diameter of the protein molecule, much greater than the range 
of the screened electrostatic repulsion. We show evidence that this long-range attraction is closely 
related to the type of anion present and ion concentration in the solution. 
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The bottleneck of protein crystallography is the lack 
of systematic methods to obtain protein crystals. This 
is partly due to incomplete understanding of the physi- 
cal chemistry conditions controlling the growth of protein 
crystals. A full comprehension of the effective protein in- 
teractions and phase behavior is therefore essential. It 
has been shown that the crystallization curves of some 
globular proteins appear to coincide with the phase dia- 
grams of a hard sphere system interacting with a short 
range attraction 0, 0, 13 ■ Small angle neutron and X-ray 
scattering investigations of proteins suggest the presence 
of a short-range attractive interaction between protein 
molecules besides the electrostatic repulsion induced by 
the residual charges 1,1113. The DLVO potential has 
been successfully applied to many colloidal systems and 
protein solutions However, it does not seem to fully 

explain the rich behaviour of proteins 0,0,0.13; and due 
to the complexity of these systems (anisotropic property, 
irregular shape, distributed charge patches, etc.), a com- 
plete understanding of the properties of the effective in- 
teractions between protein molecules in solutions remains 
a challenge |1| . 

Recent measurements of small angle neutron scattering 
(SANS) intensity distribution in protein solutions show 
interesting results d 0, Beside the normal first 

diffraction peak, it is present a peak (cluster peak) ap- 
pearing at a much smaller scattering wave vector, Q, due 
to the formation of ordered clusters. The appearance 
of a cluster peak is explained as due to the competi- 
tion of a short-range attraction and a long-range elec- 
trostatic repulsion [1 0, ^|. Moreover, a rising in- 
tensity as Q approaches zero (zero-Q peak) is observed 
in both liquid-like and solid-like samples, which implies 
that the effective potential should have more features in 
addition to the well known short-range attraction and 
electrostatic repulsion. The existence of a long-range 
attraction between protein molecules has already been 
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postulated in theoretical studies. Noro et al. suggested 
that the presence of a long-range attractive force between 
protein molecules should shift the metastable fluid-fluid 
critical point out of gel regime |13| | . Thus a protein crys- 
tallization may occur without gelation. By employing 
two Yukawa potential model, Lawlor et al. showed that 
the introduction of a long-range attraction between pro- 
tein molecules can enhance crystal growth by avoiding 
the formation of a disordered state, an attractive glass 
0, 0|, while preserving the equilibrium features of a 
system with the short-range attraction [l^ . 

In this paper, by systematically studying the zero-Q 
peak, we find that a weak long-range attraction needs to 
be considered to explain the SANS scattering intensity 
distributions. The properties of this long-range attrac- 
tion potential are also investigated. Cytochrome C from 
horse heart (product no. C7752) was purchased from 
Sigma and is obtained using a procedure that avoids the 
trichloroacetic acid (TCA) that is known to promote the 
dimer formation. Cytochrome C has been dialyzed three 
times in order to remove any extra salt. Buffers are not 
used to avoid possible bindings of organic molecules to 
the protein surface. Lysozyme from chicken egg white 
was purchased from Fluka (product no. L7651) and used 
without further purification. The pD of cytochrome solu- 
tions at 7.2 and 9.5, has been adjusted by a HCl standard 
solution. Only samples of Lysozyme have been prepared 
in 20 niM HEPES buffer. The pD value was sequentially 
checked by an ISFET pHMeter (KS723) before and after 
each performed experiment and found stable within 0.1 
units. All samples were prepared a few days before the 
scheduled experiments to allow the hydrogen/deuterium 
exchange. For all solutions the final protein concentra- 
tion has been measured by UV- Visible spectroscopy. Our 
experiments were performed at the small angle neutron 
scattering station, NG7, at the Center of Neutron Re- 
search in the National Institute of Standard and Technol- 
ogy. Two configurations have been used to reach a wide 
range of the wave vector, Q, from G.004A~^ to 0.30A~^, 
where Q — ^sin{9/2), A is the neutron wavelength, 6 
the neutron scattering angle. All the analyses have taken 
into account the instrumental resolution correction. 
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FIG. 1: Theoretical calculations of I{Q) resulting from one 
Yukawa attraction at 1% volume fraction, a) the effect due 
to the variation of the attraction strength, K at Z — 10. b) 
the effect due to the variation of the attraction range, 1/Z at 
K — 0.5. The inset shows I{Q) from a cytochrome C sample 
at 1% volume fraction in IM NaCl at pD=ll. 




FIG. 2: SANS intensity distribution of cytochrome C solu- 
tions at pD=ll and at 1% volume fraction with different salts 
added. The solid lines are the theoretical analyses. 



SANS intensity distribution, /(Q), can be expressed 
as I{Q) = AP[Q)S{Q), where P(Q) is the normaHzed 
particle structure factor, S{Q), the inter-particle struc- 
ture factor, A, the known amphtude factor which is pro- 
portional to the volume fraction and the square of the 
neutron scattering length contrast between protein and 
solvent ^3 • Cytochrome C has an ellipsoidal shape with 
the semi-major and minor axes, axbxb — 15xl7x 17 A^, 
while a lysozyme molecule has a dimension axbxb — 
22.5 X 15 X 15A^. P{Q) is calculated by considering the 
ellipsoidal shape of the protein. S{Q) is calculated by 
solving the Ornstein-Zernike (OZ) equation within the 
mean spherical approximation (MSA) closure involving 
an effective pair potential, V{r), to be specified later. 
The effect on S{Q) due to the non-spherical shape of a 
particle is approximately taken into account by using the 
decoupling approximation [l^ . 

Figure 1 shows theoretical calculations together with 
the SANS result from a cytchrome C solution at pD — 11 
with 1% volume fraction in IM NaCl in the inset. 
Conventionally, the effective inter-protein potential is 
considered to consist of a short-range attractive and a 
long-range electrostatic repulsive part. The repulsion is 
screened out by the concentrated salt. Therefore, S{Q) 
could be obtained by solving OZ equation by consider- 
ing only a short-range attraction, which can be approxi- 
mated by an attractive potential of a Yukawa form [T^ . 
V{r) = —K- — ^- — ^, where K is normalized by fcsT, 
and r is the inter-protein distance normalized by a with 
a = 2{ab'^Y/^. The interaction range, is approxi- 
mately about 10% of the protein diameter Figure 
1(a) shows the theoretically calculated SANS intensity 
distribution with Z = 10 by taking the amplitude factor 
A equal to unity. As K increases from 0.5 to 16, the in- 
tensity at Q = increases gradually. The results from a 
short-range attraction always smoothly change the whole 



scattering curve and can not reproduce the sharp rising 
up of experimental SANS intensity at very low Q (zero-Q 
peak) as shown in the inset. Since the typical str eng th of 
short-range attraction is smaller than 10fcRT|3. l2fl[ l2ll|. 
the short-range attraction can not explain the observed 
SANS intensity distribution. In Figure 1(b), K is fixed 
at 0.5. When Z decreases from 10 to 0.5, i.e. the at- 
traction range increases, the intensity at low Q increases 
sharply and the theoretical curve exhibits a similar trend 
to the experimental intensity. The comparison between 
the experimental result and theoretical curves thus sug- 
gests that the zero-Q peak could be induced by a weak 
long-range attraction. Therefore, the effective potential 
between protein molecules in solutions should consists of 
three features: a short-range attraction, an intermediate- 
range electrostatic repulsion, and a weak long-range at- 
traction. This zero-Q peak has been overlooked in previ- 
ous experiments due to the limited Q range covered 0| . 
However, it has been observed before in lysozyme protein 
solutions ^1 and was explained as due to the long-range 
density fluctuation. 

To investigate the properties of this long-range attrac- 
tion, three different cytochrome C samples at pD=ll 
with 1% volume fraction added with different salts were 
measured and their SANS intensity distributions with 
error bars are plotted in semi-log scale to clearly show 
zero-Q peaks in Figure 2. The salt concentration, which 
is indicated in the figure, has been changed to keep 
the same ionic strength. Very interestingly, the results 
show that the zero-Q peak depends on the different an- 
ions added. NaCl induces a much weaker zero-Q peak 
compared with that of NaSCN and Na2SOi. Since 
the true form of this long-range attraction is still un- 
known, in order to fit the experimental results, we assume 
that the effective inter-protein potential can be simu- 
lated by two attractive Yukawa form potential, Vty {f) — 
—Ki- — — ^ — K2- — — The first term is used to 
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TABLE I: Fitted parameters using the two attractive Yukawa 
form potential. Results are shown in Figure 2. 





a{X) 




Zx 




^2 


NaCl 


14.7 


7±3 


7±3 


0.11 ± 0.02 


0.20 ±0.01 


NaSCN 


14.6 


8±5 


10 ±6 


0.33 ± 0.02 


0.35 ±0.01 


NaiSOi 


14.7 


4±5 


10± 10 


0.24 ±0.02 


0.27 ±0.02 




0.01 0,1 
Q(A-1) 



FIG. 3: SANS intensity distributions of cytochrome C sam- 
ples at 5% volume fraction. The solid lines are theoretical 
analyses using two Yukawa potential. 



simulate the short-range attraction, while the other one 
is used to simulate the long-range attraction. The fitted 
results are given in table I. 

The fitting is not sensitive at all to the short-range at- 
traction. The long-range attraction has a weak strength 
(less than 0.5 ksT) and a range of 3 ~ 5 times the pro- 
tein diameter, and it is very sensitive to the anions added 
to the solutions. This implies that the long-range attrac- 
tion is at least partly induced by the ion cloud around 
protein molecules. Depletion forces and van der Waals 
force can not explain this feature. It is very interesting 
to notice that the strength and range of the long-range 
attraction is about the same value as the attractive po- 
tential between like-charged particles ■ However, a 
charged colloidal particle typically has a uniform charge 
distribution on the surface, while a protein molecule has 
both positive and negative charge patches, which make 
the ion cloud distribution around a protein much more 
complicated. 

Since the long-range attraction seems to be induced by 
the ion cloud, if we can minimize the ion concentration in 
solutions, we should expect to greatly suppress the zero- 
Q peak. Figure 3 shows SANS results of two cytochrome 
C samples at pD=7.2 and pD=9.5 with 5% volume frac- 
tion. Without adding any extra salt in solutions, the ion 
concentration is determined by the dissociated charges 
from proteins. The cytochrome C molecule has much 
smaller charge number at pD=9.5 than that at pD=7.2, 
being the isoelectric point, p/, about 10.2. This leads 
to a smaller ion concentration in solutions at pD=9.5. 
Correspondingly, the SANS intensity distribution shows 
a zero-Q peak weaker than that at pD=7.2. This implies 



TABLE IL Fitted parameters obtained using the two Yukawa 
potential. Results are shown in Figure 3. 







a(A) 


charge number 


K2 


Z2 


pD 


= 7.2 


15.3 


3.8 ±0.1 


0.37 ±0.02 


0.34 ±0.01 


pD 


= 9.5 


15.4 


1.7 ±0.1 


0.08 ±0.01 


0.21 ±0.01 



that the zero-Q peak depends on the ion concentrations, 
and is not likely due to permanent cluster formations, 
since a weaker repulsion should favor larger cluster for- 
mations and thus induce larger zero-Q peak at pD=9.5, 
which is in contrast with our experimental results. In or- 
der to quantitatively analyze the zero-Q peak, we again 
used the two- Yukawa potential model. However, due to 
the existence of the weak long-range attraction, we need 
three Yukawa terms to completely simulate the features 
of the potential. We argue that in cytochrome C solu- 
tions, the short-range attraction is small so that its ef- 
fect on the structure factor should be very small at the 
relatively low volume fraction. Actually, the first diffrac- 
tion peak of SANS intensity distribution in cytochrome 
C solutions has been successfully analyzed by only con- 
sidering the electrostatic repulsion at various pD values 
[T'2> 17]. Furthermore, for cytochrome C solutions, we 
did not observe the cluster peak as it is observed for 
lysozyme solutions. The lack of cluster peak is attributed 
to the weak short-range attraction. Therefore, we ignore 
the short-range attraction and we use the second term of 
Vty for the long-range attraction contribution and the 
first term of Vty to simulate the electrostatic repulsion, 
which can be calculated by the charge number of the pro- 
tein molecule and the ionic strength 0, 0| . The fitted 
results given in table II show that the range of the long- 
range attraction is about 3^5 times of protein diameter, 
and also confirm our direct observation from Figure 3, i.e. 
the strength of long-range attraction at pD=9.5 is much 
smaller. 

After showing the existence of the long-range attrac- 
tion in solutions dominated by monomers, it is important 
to check its existence in protein solutions having equilib- 
rium clusters. Figure 4 shows the results of lysozyme 
protein solutions at pD=5.1 and at 10% and 20% vol- 
ume fraction in HEPES buffer. The top panel shows 
SANS intensity distributions with the fitted results and 
the bottom panel shows the calculated S{Q) from the 
fitted parameters. The main peak in the top panel is a 
cluster peak as it is independent of volume fraction. In- 
terestingly, even in the presence of cluster peak, both 
results have shown the zero-Q peak, which was over- 
looked by previous experiments 5]. The coexistence of 
the cluster peak and zero-Q peak clearly indicates the 
necessity of introducing a third potential feature, a long- 
range attraction. We used the two Yukawa model previ- 
ously described to fit the main peak, assuming that the 
first term is the short range attraction, and the second 
term the electrostatic repulsion. The fitted curves are 
shown as dotted line in the figure and the fitting results 
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FIG. 4: SANS intensity distributions of Iysozyme samples at 
pD=5.1 with 20 mM HEPES buflFer. Dotted lines (a and b) 
are fitted by considering only a short-range attraction and 
electrostatic repulsion. Solid lines (a and b) are the fitted 
results by using three Yukawa form potential. Lower panel 
shows the calculated structure factor, S{Q). 



TABLE III: Fitted parameters from fitting the cluster peaks 
by using only a short-range attraction and the electrostatic 
repulsion (see Figure 4). 





a(A) 


charge number 


Ki 


Zi 


10% 


21.1 


7.0 ± 1.2 


6.9 ±1.0 


14 ±5 


20% 


21.2 


6.8 ±1.5 


7.7 ± 1.2 


15 ±5 



are given in table III. The attraction strength is about 
IksT with 7% of attraction range, which is consistent 
with Ref.0|. In order to fit the complete SANS dis- 
tribution, we use the two Yukawa model as the reference 
system and treat the long-range attraction as a perturba- 
tion by employing the random phase approximation [24j . 
The long-range attraction is considered here as a third 



Yukawa form, —K3- — — — -. The results fitted with this 
approximation are shown as solid line. The addition of 
this third potential feature slightly changes the previous 
fitting parameters. The parameters for the long-range 
attraction are, K3 = 0.038 ± 0.005 and Z3 = 0.17 ± 0.01 
for the 10% Iysozyme sample, while for 20% sample, 
K3 ^ 0.019 ± 0.002 and Z3 ^ 0.21 ± 0.03. 

Judging from SANS results on both cytochrome C and 
Iysozyme protein solutions, we believe that the existence 
of weak long-range attraction is universal for all pro- 
tein solutions. Its strength and range depend both on 
the type of anion and ion concentration in solutions. In 
other words, it depends on the ion cloud around protein 
molecules. Our finding has important implications in un- 
derstanding the protein crystallization process. There 
are numerous measurements of second virial coefficient 
in protein solutions by static light scattering experiments 
[25, 26]. Due to the very small Q range the light scatter- 
ing can access, its intensity is closely related to the height 
of the zero-Q peak, which depends on the feature of the 
long-range attraction. Thus we believe that the conclu- 
sions derived from those observations should take into 
account the existence of the weak long-range attraction. 
The charge dependence of some of these phenomena can 
then be naturally related to the charge dependence of the 
long-range attraction 0, . Also the very large cluster 
formation or gelation in supersaturated protein solutions 
may be induced by the long-range attraction [2M |29| . 
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